Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
そして | 2682 | 75 | 1 | 75.0000 |
くれる | 1413 | 47 | 1 | 47.0000 |
しかし | 3681 | 46 | 1 | 46.0000 |
こうした | 1401 | 100 | 3 | 33.3333 |
示す | 697 | 31 | 1 | 31.0000 |
そこで | 1285 | 29 | 1 | 29.0000 |
いる | 61257 | 170 | 6 | 28.3333 |
受ける | 589 | 28 | 1 | 28.0000 |
目指す | 946 | 28 | 1 | 28.0000 |
ウド | 557 | 27 | 1 | 27.0000 |
000 | 1871 | 51 | 2 | 25.5000 |
しまう | 2300 | 49 | 2 | 24.5000 |
いく | 4015 | 78 | 4 | 19.5000 |
さらに | 4932 | 141 | 9 | 15.6667 |
迎える | 307 | 14 | 1 | 14.0000 |
Fi | 304 | 14 | 1 | 14.0000 |
めぐる | 226 | 14 | 1 | 14.0000 |
◇ | 586 | 26 | 2 | 13.0000 |
持つ | 1564 | 52 | 4 | 13.0000 |
超える | 574 | 25 | 2 | 12.5000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
すれ | 1936 | 1 | 116 | 0.0086 |
だっ | 7556 | 4 | 406 | 0.0099 |
ませ | 5232 | 1 | 93 | 0.0108 |
しよ | 1435 | 1 | 83 | 0.0120 |
なけれ | 1746 | 1 | 77 | 0.0130 |
ましょ | 1963 | 1 | 62 | 0.0161 |
まし | 9207 | 3 | 182 | 0.0165 |
なかっ | 4480 | 3 | 150 | 0.0200 |
だろ | 3658 | 2 | 92 | 0.0217 |
さ | 49831 | 29 | 1194 | 0.0243 |
でし | 1547 | 2 | 78 | 0.0256 |
ごと | 1292 | 3 | 108 | 0.0278 |
し | 137461 | 78 | 2354 | 0.0331 |
ならでは | 437 | 1 | 30 | 0.0333 |
容疑 | 2930 | 7 | 199 | 0.0352 |
ものの | 1086 | 1 | 26 | 0.0385 |
です | 23994 | 25 | 559 | 0.0447 |
。 | 294076 | 47 | 957 | 0.0491 |
ため | 13882 | 13 | 241 | 0.0539 |
でしょ | 3345 | 4 | 68 | 0.0588 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II